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Abstract 

We consider the discrepancy problem of coloring n intervals with k colors such that at each point on the 
line, the maximal difference between the number of intervals of any two colors is minimal. Somewhat 
surprisingly, a coloring with maximal difference at most one always exists. Furthermore, we give an 
algorithm with running time 0(nlogn + fcnlogfc) for its construction. This is in particular interesting 
because many known results for discrepancy problems are non-constructive. This problem naturally mod- 
els a load balancing scenario, where n tasks with given start- and endtimes have to be distributed among 
k servers. Our results imply that this can be done ideally balanced. 

When generalizing to d-dimensional boxes (instead of intervals), a solution with difference at most 
one is not always possible. We show that for any d>2 and any k > 2 it is NP-complete to decide if such 
a solution exists, which implies also NP-hardness of the respective minimization problem. 

In an online scenario, where intervals arrive over time and the color has to be decided upon arrival, 
the maximal difference in the size of color classes can become arbitrarily high for any online algorithm. 

1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems — Sequencing and 
scheduling 
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1 Introduction 

In this paper, we consider the following load balancing problem: We are given a set I = {Ii ,...,/„} 
of tasks, where each task is represented by an interval I = [£,r] El with starttime £ and endtime r. 
Furthermore, we are given k servers and have to assign the tasks to the servers as evenly as possible. 
That is, we want to minimize the maximal difference of the numbers of tasks processed by any two 
servers over all times. 

We formalize this in terms of an interval coloring problem: We are given a set 1 = {Ii, ...,/„} 
of n intervals on the real line and a set if = {1, . . . , k} of k colors. A k-coloring is a mapping 
X : I —> K. For a fixed fc-coloring \ an d a point x £ R, let Ci(x) denote the number of intervals 
containing x that have color i in \. Define the imbalance of x at x by 

imb(ir) = max \ci{x) — Cj(x)\. (1) 

In words, this is the maximum difference in the size of color classes at point x. The imbalance of x is 
given by imb(x) = max^gj^ imb(x). 

These definitions yield the following minimization problem: 

Minimum Imbalance Interval ^-Coloring 

Instance: A set of intervals X. 

Task: Find a fc-coloring x with minimal imb(x). 

We call a fc-coloring with imbalance at most one balanced. Observe that if the number of intervals 
intersecting at some point is not divisible by fc, then imbalance at least one is unavoidable. On the 
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other hand, if the number of intersecting intervals is divisible by fc, then no coloring having imbalance 
one exists. Thus, if a balanced coloring exists, its imbalance is minimal. 

As we will see shortly, it is always possible to find a balanced interval fc-coloring. Hence, we will 
mostly be concerned with its construction. More specifically, the questions considered in this paper 
are outlined as follows: 

(i) Is there always a balanced fc-coloring? 

(ii) If so, is it possible to construct a balanced fc-coloring in polynomial time? 

(iii) If we consider arcs of a circle (instead of intervals), do balanced fc-colorings always exist? 

(iv) How is the situation if intervals arrive online? 

(v) If (i-dimensional boxes (instead of intervals) are considered, can the existence of a balanced 
fc-coloring be decided in polynomial time? 

The problem has close connections to discrepancy theory; see Doerr [9 1 and Matousek [ 17] for 
introductions to the field. Let H — (X, U) be a hypergraph consisting of a set X of vertices and 
a set U C 2 X of hyperedges. Analogous to the previous definitions, a fc-coloring is a mapping 
X ■ X — > K, and the imbalance imb(x) is the largest difference in size between two color classes 
over all hyperedges. The discrepancy problem is to determine the smallest possible imbalance, i. e., 
disc(JJ) = min x:X ^ A 'inib(x). 

Hence our problem is to find the discrepancy of the hypergraph H — (I, U), where U is the 
family of all maximal subsets of intervals intersecting at some point. It turns out that this hypergraph 
has totally unimodular incidence matrix, which is useful because de Werra [22 1 proved that balanced 
fc-colorings exist for hypergraphs with totally unimodular incidence matrix. However, the proof 
in 1 22 1 is only partially constructive: A balanced fc-coloring is constructed by iteratively solving the 
problem of balanced 2-coloring on hypergraphs with totally unimodular incidence matrix, for which 
no algorithm was given in [ 22 1 . 

Further related work in discrepancy theory mostly considers hypergraph coloring with two colors 
and often from existential, rather than algorithmic perspective. For an arbitrary hypergraph H with 
n vertices and m hyperedges, the bound disc(i/) < ^/2nln(2m) for 2-coloring follows with the 
probabilistic method; see also \9\. For m > n, Spencer ll20l proved the stronger result disc(-ff) = 
0(y/n\og{m/n)), which is in particular interesting for m = 0(n). If each vertex is contained 
in at most t edges, the 2-coloring bound disc(i/) = 0(ytlagn) was shown by Srinivasan lETTl 
and the bound disc(if) < 2t — 1 by Beck and Fiala [5|. Biedl et al. [6| improved the bound 
to disc(iJ) < max{2i — 3, 2} for 2-colorings and established disc(iJ) < At — 3 for general fc- 
colorings. They also showed that it is NP-complete to decide the existence of balanced fc-colorings 
for hypergraphs with t > max{3, fc — 1} and fc > 2. 

Bansal [4| recently gave efficient algorithms that achieve 2-color imbalances similar to B01I2TI 
up to constant factors. In particular, an algorithm yields disc(i?) = 0(y/n\og(2m/n)) matching 
the result of Spencer [20] if m — 0(n). Furthermore, disc(i/) = 0(^/ilogn) complies with the 
non-constructive result of Srinivasan ETl . For general fc > 2, Doerr and Srivastav [ 10 1 gave a 
recursive method constructing fc-colorings from (approximative) 2-colorings. 

Unfortunately, these results on general discrepancy theory do not answer any of the problems 
considered here, because t is only bounded by the number of vertices. 

Our Contributions. We contribute the following answers to the above questions: 

(i) Balanced fc-colorings exist for any set I of intervals, i. e., question |i| can always be answered 
in the affirmative. We establish this by showing that our hypergraph H has totally unimodular 
incidence matrix and then applying a result of de Werra El . This also follows independently 
from our algorithmic results below. 
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(ii) We present an 0(n log n) time algorithm for finding a balanced 2-coloring, thereby establishing 
the first constructive result for intervals. Furthermore, we give an 0(n log n + knlogk) 
algorithm for finding a balanced k -coloring. This is an improvement in time complexity, since 
the construction of de Werra [221 combined with our algorithm for 2-coloring only yields 
0(n log n + k 2 n). We also note that our algorithm works for any hypergraph with incidence 
matrix having the consecutive-ones property. 

(iii) If we consider arcs of a circle instead of intervals, balanced fc-colorings do not exist in general. 
However, we give an algorithm achieving imbalance at most two with the same time complexity 
as in the interval case. 

(iv) In an online scenario, in which we learn intervals over time, the imbalance of any online 
algorithm can be made arbitrarily high. 

(v) For d-dimensional boxes, it is NP-complete to decide if a balanced fc-coloring exists for any 
d > 2 and any k > 2. Our reduction is from Not-All-Equal 3SAT. This result clearly 
implies NP-hardness of the respective minimization problem. 

2 Interval Colorings 

In this section, we consider MINIMUM IMBALANCE INTERVAL fc-COLORlNG, establish the existence 
of balanced fc-colorings, and give algorithms for 2 and k colors, respectively. Later, we consider arcs 
of a circle and an online version. 

2.1 Existence of Balanced / -Colorings 

We begin by observing the existence of balanced fc-colorings. In the proof below, we use a theorem 
of de Werra [22], but the existence of balanced fc-colorings also follows from our algorithmic results. 

► Theorem 1 . For any setT of intervals and any k € N, there is a balanced k-coloring. 

Proof. Let I be the set of given intervals. Define a hypergraph H = (I, U), where U is the family 
of all maximal subsets of intervals intersecting at some point. For H with / = ...,/„} and 
U = {Ui, . . . , U m }, the incidence matrix is defined by A = (ffl* 3 -) with a, j = 1 if Jj G Uj and 
CLi j — otherwise. 

De Werra 11221 showed that any hypergraph with totally unimodular incidence matrix admits 
a balanced fc-coloring. It is well-known [19] that a 0-1 -matrix is totally unimodular if it has the 
consecutive-ones property, i. e., if there is a permutation of its columns such that all 1-entries appear 
consecutively in every row. The incidence matrix A of H has this property: If we order the Uj in 
increasing order of intersection points, then the entries ctij — 1 appear consecutively in each row. -4 

2.2 Algorithm for Two Colors 

In this section, we present an algorithm that constructs a balanced 2-coloring in polynomial time. 
Since the algorithm produces a valid solution for every possible instance, Theorem[T]for fc = 2 also 
follows from this algorithmic result. We note in passing that a polynomial-time algorithm can also 
be obtained by solving a simple Integer Linear Program (ILP) with a totally unimodular constraint 
matrix. However, this gives a much worse running time bound of 0(n 5 / log n) Q. 

The main idea of our algorithm is to simplify the structure of the instance such that the remaining 
intervals have start- and endpoints occurring pairwise. We then build a constraint graph that has the 
intervals as vertices. Finally, a proper 2-coloring of the constraint graph induces a solution to the 
problem. 
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The start- and endpoints of the intervals are called events. A region is an interval spanned by 
two consecutive events and is called even (odd) if it is contained in an even (odd) number of input 
intervals. 

► Theorem 2. For any set I of n intervals, there is a balanced 2-coloring that can be constructed 
in 0(nlogn) time. 

Proof. W. 1. o. g. we can assume that the start- and endpoints of the input intervals are pairwise 
disjoint. If not, a new instance can be obtained by repeatedly increasing one of the coinciding start- 
or endpoints by e/2, where e is the minimum size of a region. Since the new instance includes 
a corresponding region for every region of the original instance, a balanced coloring for the new 
instance is a balanced coloring for the old instance (the converse is not true). 

Observe that a coloring that is balanced on all even regions is also balanced on all odd regions. 
This is because odd regions only differ by one interval from a neighboring even region. Thus, the task 
reduces to constructing a balanced coloring of the even regions. Since between two consecutive even 
regions, exactly two events occur, it suffices to consider only pairs of consecutive events enclosing 
odd regions. 

If a pair of events consists of the start- and endpoint of the same interval, this interval is assigned 
any color and is removed from the instance. If a pair consists of start- and endpoint of different 
intervals, these intervals are removed from the instance and substituted by a new (minimal) interval 
that covers their union. In a final step of the algorithm, both intervals will be assigned the color of 
their substitution. The remaining instance consists solely of pairs of events where two intervals start 
or two intervals end. Clearly, a balanced coloring has to assign opposite colors to the corresponding 
two intervals of such a pair, and any such assignment yields a balanced coloring. 

The remaining pairs of events induce a constraint graph. Every vertex corresponds to an interval, 
and an edge is added between two vertices if there is a pair of events containing both startpoints or 
both endpoints. Finding a proper vertex two-coloring of this graph gives a balanced 2-coloring. The 
constraint graph is bipartite: Each edge can be labeled by "h" or "H" if it corresponds to two start- or 
endpoints, respectively. Since each interval is incident to exactly two edges, any path must traverse 
h- and H-edges alternatingly. Therefore, every cycle must be of even length and hence the graph is 
bipartite. Thus, a proper vertex two-coloring of the constraint graph can be found in linear time by 
depth-first search. 

Sorting events takes 0(n log n) time. Creation of the constraint graph and coloring it takes linear 
time. -4 

Note that if intervals are given already sorted, or interval endpoints are described by small integers, 
then the above algorithm can even find a balanced 2-coloring in linear time. 



2.3 Algorithms for k Colors 

In this section, we extend the results of the previous section to an arbitrary number of colors k and 
show that a balanced interval fc-coloring can be found in polynomial time. 

A first polynomial time algorithm can be obtained using a construction by de Werra l22ll : Start 
with an arbitrary coloring and find two colors i and j for which max^ \ci(x) — Cj(x) \ is maximal. 



Use the algorithm from Section 2.2 to find a balanced 2-coloring of all intervals that currently have 
color i or j and recolor them accordingly. Repeat until the coloring is balanced. This algorithm has 
running time 0(n log n + k 2 n), because sorting intervals is needed only once for the above algorithm 
for 2 colors, and there are at most (^) recolorings necessary. 

In the following, we present an alternative algorithm for k colors, which is faster than 0(n log n + 
k 2 n). We will first give an overview of the argument, and then a more formal description. 
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As in Section 2.2 we assume w. 1. o. g. that all start- and endpoints are pairwise disjoint. The idea 
is to scan the events in order, beginning with the smallest, and to capture dependencies in fc-tuples 
of intervals that indicate pairwise different colors. That is, we reduce the MINIMUM IMBALANCE 
Interval ^-Coloring instance to an instance of Strong Hypergraph Coloring, formally 
defined as follows. 

Strong Hypergraph Coloring 

Instance: A ground set X, a family S of constraints Si, . . . , S n CI, and an integer fc. 
Task: Find a fc-coloring \ : X — > K with V I < i < n : x , y £ Si, x ^ y x( x ) x(v)- 

For example, in the special case that each block of fc consecutive events consists only of start- or only 
of endpoints, the constraints that the corresponding k intervals have to be differently colored will 
capture the whole solution. As we will see below, also different interval nesting structures can be 
captured by such constraints. 

Strong Hypergraph Coloring is NP-hard in general |1 1. However, each interval will occur 
in at most two constraints, corresponding to its start- and endpoint. Thus, we can further reduce the 
Strong Hypergraph Coloring instance to Edge Coloring, where the goal is to color edges 
of a multigraph such that the edges incident to each vertex are all differently colored. More formally: 

Edge Coloring 

Instance: A multigraph G = (V,E) and an integer k. 

Task: Find a fc-coloring y : E K with Ve, e'e£,ene'^,e^e'=> x(e) ^ x(e'). 

It is easy to see that any instance (A", S, k) of STRONG HYPERGRAPH COLORING where each element 
of X occurs in at most two constraints can be reduced to Edge COLORING as (G = (S, E), fc), 
where for each x £ X that occurs in Si and Sj with i j, we add the edge {Si, Sj} to E. For our 
case, a constraint corresponds to a vertex, and an interval corresponds to an edge that connects the 
two constraints it occurs in. An edge coloring with k colors will thus provide a balanced interval 
fc-coloring. 

Clearly, an edge coloring of a multigraph with maximum degree A needs at least A colors. 
Finding an edge coloring of minimum size is NP-hard in general [14|. However, Konig lfl6l showed 
that for bipartite multigraphs, A colors in fact always suffice. Further, an edge coloring of a bipartite 
multigraph with m edges can be found in 0(m log A) time [8 1. The multigraph we will construct has 
maximum degree k and is bipartite. Thus, a balanced interval fc-coloring always exists and can be 
found in polynomial time. 

We now describe the construction of the constraints and give a more rigorous description of 
the results. From a MINIMUM IMBALANCE INTERVAL fc-COLORlNG instance 1, we construct a 
Strong Hypergraph Coloring instance (I, S, fc) over the ground set of the intervals. The 
algorithm scans the set of events in order, beginning with the smallest. It keeps a set of active events 
and adds constraints to S. The set of active events will be cleared at each region where the number of 
intervals is modulo fc. Thus, the active events always describe the change from a situation where 
each color occurs the same number of times. 

The construction of the constraints can be visualized with a decision tree, depicted for the example 
fc = 4 in Figure [T[ At the beginning, the set of active events is empty (which corresponds to the 
root of the decision tree). Whenever the set of events is empty, the algorithm branches into two 
cases depending on the type of the next event. Both branches are equivalent, with the roles of start - 
and endpoints interchanged. Therefore, assume the next event is the start of an interval (depicted 

a 

as h on the left branch). It is added to the active set. This continues until either fc startpoints of 
intervals I\ , . . . , If. are added, or an endpoint is encountered. In the first case, the constraint 



(2) 
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a 4- 2/1 



a4-y x 



h: (abed) 



h: (abcxi) 
H: (yiyidxi) 
a 4- yx,b 4- y 2 



H: (abcxi) 
h: (yiy 2 dx 4 ) 
a <- yx,b 4- y 2 



H: (abed) 



Figure 1 Tracking of active events for A; = 4 

is constructed and the set of active events is cleared (dashed arrow returning to the root). In the second 
case, assume the startpoints of intervals Ii, . . . , Ij have been added and the endpoint of interval Ij+i 
is encountered. Then, the two constraints 



are constructed. Here, Xj+i, . . . , x k and yi, ■ ■ ■ , Vj-i are new "virtual" intervals that have not been 
used in previous constraints. That is, they do not correspond to actual intervals of the real line and 
only serve as placeholders in the active set. Furthermore, the startpoints of the intervals Ii , . . . , Ij are 
replaced by the startpoints of j/i, . . . , yj-i in the active set of events (indicated by the dashed arrows 
pointing one level higher in the decision tree). 

We now prove the correctness of the two chained reductions. 

► Lemma 3. A solution to the STRONG HYPERGRAPH COLORING instance (I,S, k), constructed 
as described above, yields a balanced k-coloring for X. 

Proof. Recall that a region is an interval spanned by two consecutive events. The proof is by 
induction over all regions, in the order of events. At each region, we count the number of times each 
of the k colors is used among the intervals containing the region. We will show that these counters 
differ by at most one, i. e., the coloring is balanced. Clearly, all counters are equal to zero before the 
first event. 

In particular, we show that in regions where the number of intervals is modulo k, all counters 
are equal, and that between these regions the counters change by at most one and all in the same 
direction. We distinguish the same cases as in the construction, limiting the discussion to the case 
that the event first added to the empty set of active events is a startpoint. 

If k startpoints of intervals 1% , . . . , are encountered, there is a constraint of the form pi, which 
ensures that all of them have different colors. Therefore, at each startpoint, a different counter 
increases by one. Hence, the counters differ by at most one in all regions up to the startpoint of Ik, 
and are all equal in the region beginning with the startpoint of Ik- 

Consider that only j < k startpoints of the intervals I\ , . . . , Ij are encountered. Since these 
startpoints were added to the set of active events during the construction, the intervals I\ , . . . , Ij do 
all occur in one constraint. This constraint forces them to have different colors, and therefore the 
colors of I\ , . . . , Ij are exactly the colors with increased count. Now, we distinguish two subcases 



(h, . . .,Ij,Xj +1 , ...,x k ) 



-,x k ) 



(3) 
(4) 
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B H : (dxixz) £) H : ( eXi x 5 ) 




"' Xj 



Figure 2 Complete example of the constraints and bipartite EDGE COLORING instance for k = 3 with a 
valid coloring. 



depending on the next event. If the next event is a startpoint of some interval, denoted by Ij+i, it 
will also be part of the same constraint. Hence, Ij + i has a different color whose count is not yet 
increased. The second subcase is that the next event is the endpoint of some interval, denoted by 
Ij+i- The interval Ij+i is forced to have the same color as one of the intervals 7i, . . . , Ij. This is 
because there is a constraint of the form <j3j> that collects all other colors in variables Xj+i, ■ ■ ■ , Xk, 
which occur together with Ij + i in a constraint of the form Q. Thus, the previously increased counter 
for the color of Ij + i decreases again. Because of the constraint of the form (BJ, the virtual intervals 
... , Vj-\ must have all of the colors of I\, . . . , Ij except for the color of Ij+i- Hence, the colors 
of yi, ... , Uj-i are exactly all remaining colors with increased count. Therefore, we are in the same 
situation as before encountering the endpoint of Ij+i- By repeating the above argument, the claim 
follows in this case. 

If the event first added is an endpoint, the argument is symmetrical. -4 

► Lemma 4. A Strong Hypergraph Coloring instance (1, S, k) constructed as described 
above can be reduced to a bipartite EDGE COLORING instance. 

Proof. We need to show that each interval occurs in at most two constraints. Once this is proved, it 
is possible to build a multigraph with the constraints as vertices and edges between them if they share 
a common interval. Further, it has to be shown that this multigraph is bipartite. To show both parts at 
once, we color the constraints in S with the two colors h and H. It then suffices to show that every 
interval can occur in at most one h-constraint and in at most one H-constraint. 

Consider Figure [2] for an illustration of the constructed constraints and the respective bipartite 
Edge Coloring instance. 

We color a constraint with h if all involved events belonging to nonvirtual intervals are startpoints, 
and with H if all these events are endpoints (see Figure[T]i. All nonvirtual intervals therefore occur in 
exactly two constraints, constructed when the start- and endpoint get removed from the active set of 
events. A virtual x-interval always occurs in a pair of subsequent differently colored constraints, and 
is not used anywhere else. For the left branch of the decision tree, a virtual y-interval occurs first in 
a H-constraint, and its startpoint is then added to the list of active events. Then, it will be used in a 
constraint of type either (|2]i or ((3), both of which are of type K The argument is symmetrical for the 
right branch of the decision tree. -4 

► Theorem 5. Every set of n intervals X has a balanced k-coloring for any k £ N, and it can be 

found in 0(n log n + kn log k) time. 
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Proof. By Lemmas [3] and [5] MINIMUM IMBALANCE INTERVAL ^-COLORING can be reduced to 
Edge Coloring with fixed k in a bipartite multigraph. The maximum degree of this multigraph is fc, 
since by construction no constraint has more than k elements. By Konig's theorem [16 | existence 
follows. 

To be able to process the events, they have to be sorted in 0(n log n) time. We have O(kn) virtual 
intervals and thus O(kn) edges in the Edge Coloring instance. Finding an edge fc-coloring for 
this multigraph with maximum degree k can be done in 0(kn log k) time 0. M 

Note that the Edge Coloring algorithm by Cole, Ost, and Schirra J8) uses quite involved 
data structures. In practice, it might be preferable to use the much simpler algorithm by Alon Q 
running in 0(m log m) time for an m-edge graph, which gives a worst-case bound of 0(kn log n) . 
An implementation of our algorithm in Python using a simple edge coloring algorithm based on 
augmenting paths can be found at http://www2.informatik.hu-berlin.de/~hueffner/intcol.py 

For an extension, recall that a matrix has the consecutive-ones property if there is a permutation 
of its columns such that all 1 -entries appear consecutively in every row. Such a permutation can be 
found in linear time by the PQ-algorithm [7 |. Given such a matrix, it is straightforward to construct 
an instance of Minimum Imbalance Interval ^-Coloring. 

► Theorem 6. For any hypergraph H with an n x m incidence matrix having the consecutive ones 
property, a balanced k-coloring can be found in 0(nm + fcnlog k) time. 

2.4 Arcs of a Circle 

In a periodic setting, the tasks I might be better described by a set of arcs of a circle rather than a set 
of intervals. In this case, there are instances that require an imbalance of two (e. g., three arcs that 
intersect exactly pairwise and k = 2). We show that two is also an upper bound and a coloring with 
maximal imbalance two can be found in polynomial time. 

► Theorem 7. The maximal imbalance for arcs of a circle is two, and finding a coloring with 
imbalance at most two can be done in 0(n log n + kn log k) time. 

Proof. Define a point on the circle, called zero, and consider counterclockwise orientation. We 
build an instance of Minimum Imbalance Interval fc-COLORiNG by "unfolding" the circle at 
zero in the following way. Consider only arcs that do not span the full circle. Map all such arcs not 
containing zero to intervals of same length at the same distance right of zero on the real line. Map 
the arcs containing zero to intervals of same length such that the positive part of the interval has the 
same length as the part of the arc in counterclockwise direction from zero. Finally, map the arcs 
containing the full circle to intervals spanning all of the instance constructed so far. Use the above 
algorithm to obtain a coloring of the intervals with imbalance at most one at every point. By reversing 
the mapping, the obtained coloring of the arcs has imbalance at most two (each point on the circle is 
mapped to at most two points of the real line). -4 

2.5 Online Algorithms 

In load balancing problems, it is often more realistic to assume an online scenario, where not all 
information is known in advance, but is rather arriving piece-by-piece, and irrevocable decisions have 
to be made immediately. In our setting, this means that intervals arrive in order of their startpoint, 
including the information of their endpoint, and a color has to be assigned to them immediately. 

The problem of finding a proper coloring (i. e., a coloring where no two intersecting intervals 
have the same color) of intervals in an online setting has found considerable interest lfT5lfTTI . In these 
works, the objective is to use a minimum number of colors. In contrast, we consider a fixed number 
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of colors and the minimization of the imbalance. We show that in contrast to the offline scenario, here 
the imbalance can become arbitrarily large. 

► Theorem 8. In online MINIMUM IMBALANCE INTERVAL /c-COLORING, the imbalance is 
unbounded. 

Proof. We first consider the case k = 2 with colors and "—1". Denote the signed imbalance 
simb(a;) to be the sum of the colors of the intervals containing x. Note that imb(a;) = | simb(a;)|. 

In the following, we outline how a sequence of intervals can be constructed such that no online 
algorithm can yield a bounded imbalance. Initially, simb = 0. Set L — [0, 1] and R = [2, 3]. Let 
Li, Ri, L r , and R r denote the start- and endpoints of the current L and R, respectively. Repeat the 
following steps. 

Present the interval [(Lg + L r ) /2, (Rp + R r ) /2] to the online algorithm. 
If it chooses color +1, set R [R t , (R e + R r ) /2], else R <- [(R e + R r )/2, R r ], 
_ Set L ^ [(L e + L r )/2,L r }. 

This sequence of intervals is legal, since the startpoints increase strictly monotonously. In each 
repetition, if the algorithm chooses color +1, the signed imbalances in L and R increase by one. If 
the algorithm chooses —1, the signed imbalance decreases by one in L and remains unchanged in R, 
i. e., the difference of the signed imbalance in L and R increases. Therefore, the signed imbalance 
diverges in L or R. Since the imbalance is the absolute value of the signed imbalance, it becomes 
unbounded. 

The construction easily generalizes to k > 2 colors. We only track two arbitrary colors, and 
whenever the algorithm assigns an untracked color to an interval, we present the same interval (with 
a slightly increased startpoint) again, forcing it to eventually assign a tracked color or to produce 
unbounded imbalance. -4 

3 Hardness of Generalizations 

We consider several generalizations of MINIMUM IMBALANCE INTERVAL fc-COLORlNG and show 
that they are NP-hard. Note that the hardness results of Biedl et al. [6| do not apply to the problems 
we consider here. 

(/-Dimensional Boxes. Gyarfas and Lehel lfT3l suggest to examine d-dimensional boxes as 
generalizations of intervals for coloring problems. The problem MINIMUM IMBALANCE d-Box 
fc-COLORlNG has as input an integer k and a set I = {Ii, . . . , /„} of n rf-dimensional boxes 

h = {[ii,i,n,i], [£i,2,n,2],- ■ ■ , [h,d,ri,d]) for 1 < i < n. 

For every point x = (xi, . . . , Xd), let S(x) be the set of boxes that include x, i. e., S(x) contains 
all the elements Ii such that ^ ,• < Xj < r^j for all 1 < j < d. For a coloring \ : X — > K, a color i, 
and a point x, let Ci(x) be the number of boxes in S(x) of color i. With the analog definition of 
imbalance imb(x) and balance for c?-dimensional boxes, the problem statement becomes: 

Minimum Imbalance d-Box fc-COLORiNG 
Instance: A set I of d-dimensional boxes. 
Task: Find a k -coloring x with minimal imb(x). 

First note that, unlike for the case d = 1, a balanced coloring may not exist: already for three 
rectangles, some instances require imbalance two. Hence, we also have a related decision problem: 

Balanced d-Box fc-CoLORiNG 
Instance: A set I of d-dimensional boxes. 
Question: Is there a balanced fc-coloring x? 
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(a) Gadget for clause Cj = (a;,, yj, 
Figure 3 The gadgets of the reduction. 




(b) Gadget for crossing chains V and W 



We show that for all d > 2 and k > 2, it is NP-complete to decide BALANCED e?-Box k- 
COLORING. This clearly implies NP-hardness of MINIMUM IMBALANCE d-Box fc-COLORlNG. 

► Theorem 9. BALANCED d-Box fc-COLORlNG is NP-complete for any d>2 and any k > 2. 

We will reduce from Not-All-Equal 3S AT (NAE-3S AT) QU. Note that the classic definition 
of NAE-3SAT |12| allows negated variables. However, this is not needed to make the problem 
NP-complete 11181 . Thus, in the sequel, we will assume that all variables occur only non-negatedly. 

Not-All-Equal 3SAT (NAE-3SAT) 

Instance: A Boolean formula with clauses C\, . . . , C m , each having at most 3 variables. 
Question: Is there a truth assignment such that in every clause, not all variables have the 
same value? 

We first consider k — 2 and then generalize to arbitrary k. We present the gadgets of the reduction, 
then show how they are combined together, and conclude by proving correctness. 

For each clause C\ = (xi,yi, zi), we construct a clause gadget comprised of three rectangles (see 
Figure[3^). Note that all three rectangles overlap in region Ai, and only there. Then we also construct 
a separate rectangle fj for every variable. Finally, we connect each rj to all rectangles that appear in 
a clause gadget, and correspond to the same variable as Tj . We do this by a chain with odd number of 
rectangles. This ensures that in any balanced 2-coloring, rj and the corresponding rectangle in the 
clause gadget have the same color. If two chains need to cross, we introduce a crossing gadget as seen 
in Figure [3]). Three rectangles are relevant for the crossing of two chains V and W. The first is Vi 
and contains areas qo, qy, and q^, the second is V%, containing qy, q^, and q^. Both V\ and V2 belong 
to chain V. The last rectangle contains areas q\ , q± and q$ and belongs to chain W. Note that the 
crossing does not induce any dependencies on the colorings between chains V and W. See Figure [4] 
for a construction of an instance for BALANCED 2-Box 2-COLORING. 

Observe that the above construction only requires a number of rectangles polynomial in the size 
of the NAE-3SAT instance. 

► Lemma 10. Balanced d-Box 2-Coloring is NP-complete for any d>2. 

Proof. The problem is in NP, since feasibility of a color assignment can be checked in polynomial 
time. For NP-hardness, we show that a NAE-3SAT instance is satisfiable if and only if the answer 
to the corresponding BALANCED 2-Box 2-COLORING instance is "yes". This also implies NP- 
completeness for every d > 2 by taking intervals of length in higher dimensions. 

(=>) Assume that there is a satisfying assignment of the NAE-3SAT instance. Then, color the 
rectangles rj according to the truth values of their corresponding variables. This coloring can be 
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Figure 4 Example for NAE-3SAT instance (xi, X2, X3), (X3, X4, xs), (xi, X5, xe). 

easily extended to all the rectangles by alternatively coloring rectangles along a chain (and crossings) 
starting from each rj and ending at a clause gadget. It remains to show that imb(a;) < 1 holds for 
every point x E Ai for all 1 < i < m. Consider A{ corresponding to clause Ci — (xi,yi,Zi). The 
three rectangles that intersect at Ai have the colors corresponding to the truth values of their variables 
Xi,yi, and Zi in the solution of NAE-3SAT. Since the three variables do not have all the same truth 
value, the three rectangles cannot have all the same color, and imb(a;) < 1. 

(-4=) Assume that we have a balanced 2-coloring for the constructed BALANCED 2-Box 2- 
COLORING instance. Consider only the clause gadgets. We have already observed that rectangles 
that correspond to the same variable and appear in clause gadgets must have the same color. We can 
assign the truth values of the variables according to the colors in the corresponding rectangles. Since 
in no Ai all three rectangles have the same color, in no Ci all three variables have the same truth 
value, yielding a feasible solution for NAE-3SAT. 

Proof Of Theorem HI First apply the construction for BALANCED 2-BOX 2-COLORING and call 
its rectangles reduction rectangles. Then add k — 2 additional rectangles that fully contain the 
construction and all intersect at least in one point outside the construction; these are called cover 
rectangles. By the latter property, cover rectangles must have distinct colors in any balanced coloring. 
Observe that each reduction rectangle contains some point that does not intersect with other reduction 
rectangles but only with all the cover rectangles. This implies that the reduction rectangles have 
available only the two colors not used by the cover rectangles. We conclude that the problem of 
fc-coloring the constructed instance is equivalent to the problem of 2-coloring only the reduction 
rectangles. -4 

Further Generalizations. The weighted version, where intervals have weights and the weighted 
imbalance is to be minimized, is NP-complete by reduction from PARTITION. Furthermore, the 
variant with multiple intervals [13] is NP-complete by reduction from NAE-3S AT. Both hardness 
results generalize to higher dimensions. The proofs can be found in Appendix [A] 

4 Open Questions 

h We have given a polynomial time algorithm for fc-coloring hypergraphs with the consecutive-ones 
property, i. e., a special case of a totally unimodular incidence matrix. It would be interesting to 
generalize to arbitrary totally unimodular incidence matrices. 

For arcs of a circle, we have shown how to find a coloring with imbalance at most two in 
polynomial time, but it is not clear how to find an optimal one. 
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h It remains open how large the imbalance can become for d-dimensional boxes, and whether we 
can find polynomial-time approximations for it. We were not able to find an instance requiring an 
imbalance greater than 2 for the 2-dimensional case. 
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A Appendix 

We consider further generalizations that can be shown to be NP-complete. 

A.1 Weighted Version 

We consider the variant where each interval / el has a weight wi S N, and we want to fc-color 1 
as evenly as possible. For a coloring \ '■ % — > K, let Wi(x) denote the total weight of the intervals 
containing x that have color i. Then the weighed imbalance is imb(x) = max^ \wi(x) — Wj(x)\, 
which we seek to minimize. 

Even for k = 2, it is NP-complete to decide if the minimum weighted imbalance of I is zero. 
This also implies NP-hardness of the minimization problem. We will reduce from PARTITION fl2l : 
Given a set A — {a±, 02, . . . ,a n } and a cost function s : A — > N, is there a subset A' C A such 
that J2aeA' s ( a ) ~ J2aeA\A' s ( a )? The reduction is straightforward: For any given instance of 
Partition of size n, construct n identical intervals [£, r] for arbitrary I < r, with each interval Ii, 
for 1 < i < n, having weight s(cti). This instance has an exactly balanced color assignment if and 
only if the answer to the corresponding PARTITION instance is "yes". 

Note that the above reduction is for 1-dimensional problems, but implies the NP-completeness of 
higher-dimensional problems, too. 

A.2 Multiple intervals 

Another generalization suggested by Gyarfas and Lehel lfT3ll is multiple intervals, where specific 
subsets of non-intersecting intervals must receive the same color. This variant is also NP-complete. 
Given a NAE-3SAT instance where no negations are allowed, for every clause Ci construct three 
intervals corresponding to the three variables of the clause. All these three intervals have the same 
startpoints £i and endpoints and £i > r^-i. Finally, for every variable, pack all the corresponding 
constructed intervals into a subset that enforces that they receive the same color. This is legal, since 
different intervals for the same variable are disjoint, and it can be easily seen that the multiple intervals 
instance has a balanced coloring if and only if the corresponding NAE-3SAT instance has a satisfying 
assignment. 

Again, the hardness generalizes to higher dimensions. 



